A semi-automatic approach for speaker mining of tapped telephone conversations
نویسندگان
چکیده
Speaker mining involves speaker detection in a set of multispeaker files. In previous work on speaker mining, training data is used for constructing target speaker models. In this study, a new speaker mining scenario was considered, where there is no demarcation between training and testing data and prior target speaker models are absent. Given the ENRON database which consists of tapped telephone conversations between traders and customers, the task is to identify conversations having one or more speakers in common. Since the poor audio quality of this database makes automatic speaker segmentation ineffective, a new technique was developed where a multi-speaker model is trained on the entire conversation and various scoring strategies were tried. A semi-automatic approach was adopted and it reduces the manual effort involved in speaker mining by 68%.
منابع مشابه
Robust Voice Mining Techniques for Telephone Conversations
Title of thesis: ROBUST VOICE MINING TECHNIQUES FOR TELEPHONE CONVERSATIONS Sandeep Manocha, Master of Science, 2006 Thesis directed by: Dr. Carol Y. Espy-Wilson Department of Electrical Engineering Voice mining involves speaker detection in a set of multi-speaker files. In published work, training data is used for constructing target speaker models. In this study, a new voice mining scenario w...
متن کاملClustering speakers by their voices
The problem of clustering speakers by their voices is addressed. With the mushrooming of available speech data from television broadcasts to voice mail, automatic systems for archive retrieval, organizing and labeling by speaker are necessary. Clustering conversations by speaker is a solution to all three of the above tasks. Another application for speaker clustering is to group utterances toge...
متن کاملAutomatic speaker clustering from multi-speaker utterances
Blind clustering of multi-person utterances by speaker is complicated by the fact that each utterance has at least two talkers. In the case of a two-person conversation, one can simply split each conversation into its respective speaker halves, but this introduces error which ultimately hurts clustering. We propose a clustering algorithm which is capable of associating each conversation with tw...
متن کاملAn Avatar-Based System for Identifying Individuals Likely to Develop Dementia
This paper presents work on developing an automatic dementia screening test based on patients' ability to interact and communicate a highly cognitively demanding process where early signs of dementia can often be detected. Such a test would help general practitioners, with no specialist knowledge, make better diagnostic decisions as current tests lack specificity and sensitivity. We investigate...
متن کاملDetection of a third speaker in telephone conversations
Differentiating speakers participating in telephone conversations is a challenging task in speech processing because only short consecutive utterances can be examined for each speaker. Research has shown that, given only brief utterances (1 second or less), humans can recognize speakers with an accuracy of about 54% on average. The task becomes even more challenging when no information about th...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007